Feature engineering for MEDLINE citation categorization with MeSH

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multilabel associative classification categorization of MEDLINE articles into MeSH keywords.

The specific characteristic of classification of medical documents from the MEDLINE database is that each document is assigned to more than one category, which requires a system for multilabel classification. Another major challenge was to develop a scalable method capable of dealing with hundreds of thousand of documents. We proposed a novel system for automated classification of MEDLINE docum...

متن کامل

A MEDLINE categorization algorithm

BACKGROUND Categorization is designed to enhance resource description by organizing content description so as to enable the reader to grasp quickly and easily what are the main topics discussed in it. The objective of this work is to propose a categorization algorithm to classify a set of scientific articles indexed with the MeSH thesaurus, and in particular those of the MEDLINE bibliographic d...

متن کامل

An Incremental Approach for MEDLINE MeSH Indexing

As an increasing number of new journal articles being added to the MEDLINE database each year, it becomes imperative to build effective systems that can automatically suggest Medical Subject Headings (MeSH) to reduce effort from human annotators. In this paper, we propose three approaches, one building upon another in an incremental way, to automatic MeSH term suggestion: 1) MetaMap-based label...

متن کامل

Clustering Citation Distributions for Semantic Categorization and Citation Prediction

In this paper we present i) an approach for clustering authors according to their citation distributions and ii) an ontology, the Bibliometric Data Ontology, for supporting the formal representation of such clusters. This method allows the formulation of queries which take in consideration the citation behaviour of an author and predicts with a good level of accuracy future citation behaviours....

متن کامل

QueryCat: automatic categorization of MEDLINE queries

A searcher's inability to formulate an appropriate query can result in an overwhelming number of retrieved documents. Our approach to this problem is to use information about common types or categories of queries to (1) reformulate the user's initial query and (2) create an informative organization of the retrieved documents from the reformulated query. To achieve these goals, we first must ide...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: BMC Bioinformatics

سال: 2015

ISSN: 1471-2105

DOI: 10.1186/s12859-015-0539-7